Bush-Mosteller learning for a zero-sum repeated game with random pay-offs
نویسندگان
چکیده
This paper deals with the design and analysis of a modijied version of the BushMosteller reiqfimement scheme applied by partners in a zero-sum repeuted grme with random pay-offs. The suggested study is based on the learning automata paradigm and a limiting average reward criterion is tackled to analyse the arising Nash equilibrium. No information concerning the distribution qf the pay-off is a priori available. The noveltjl of the suggested adaptive strategj3 is related to the incorporation qf a 'normalization procedure' into the standard Bu.sh-Mostcller scheme to provide a po.r.sibilitj, to operate not only with binary but also ~ , i t h unj3 hounded rewards of a stochastic nature. The analysis of the convergerre (adaptation) us well as the convergence rate (rute of adaptation) are presented and the optimal design parumetcrs of this adaptive procedure are derived. The obtained adaptation rute turns out to be of o(n-'I3).
منابع مشابه
A More General Model of Cooperation Based on Reinforcement Learning: Alignment and Integration of the Bush-mosteller and the Stochastic Collusion and the Power Law of Learning: Aligning and Integrating the Bush-mosteller and the Roth-erev Reinforcement Learning Models of Cooperation
Analytical game theory has developed the Nash equilibrium as theoretical tool for the analysis of cooperation and conflicts in interdependent decision making. Indeterminacy and demanding rationality assumptions of the Nash equilibrium have led cognitive game theorists to explore learning-theoretic models of behavior. Two prominent examples are the Bush-Mosteller stochastic learning model and th...
متن کاملLearning through reinforcement for N-person repeated constrained games
The design and analysis of an adaptive strategy for N-person averaged constrained stochastic repeated game are addressed. Each player is modeled by a stochastic variable-structure learning automaton. Some constraints are imposed on some functions of the probabilities governing the selection of the player's actions. After each stage, the payoff to each player as well as the constraints are rando...
متن کاملMultiple attribute decision making with triangular intuitionistic fuzzy numbers based on zero-sum game approach
For many decision problems with uncertainty, triangular intuitionistic fuzzy number is a useful tool in expressing ill-known quantities. This paper develops a novel decision method based on zero-sum game for multiple attribute decision making problems where the attribute values take the form of triangular intuitionistic fuzzy numbers and the attribute weights are unknown. First, a new value ind...
متن کاملA TRANSITION FROM TWO-PERSON ZERO-SUM GAMES TO COOPERATIVE GAMES WITH FUZZY PAYOFFS
In this paper, we deal with games with fuzzy payoffs. We proved that players who are playing a zero-sum game with fuzzy payoffs against Nature are able to increase their joint payoff, and hence their individual payoffs by cooperating. It is shown that, a cooperative game with the fuzzy characteristic function can be constructed via the optimal game values of the zero-sum games with fuzzy payoff...
متن کاملReinforcement learning account of network reciprocity
Evolutionary game theory predicts that cooperation in social dilemma games is promoted when agents are connected as a network. However, when networks are fixed over time, humans do not necessarily show enhanced mutual cooperation. Here we show that reinforcement learning (specifically, the so-called Bush-Mosteller model) approximately explains the experimentally observed network reciprocity and...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Int. J. Systems Science
دوره 32 شماره
صفحات -
تاریخ انتشار 2001